Knowledge Compilation for Itemset Mining
ثبت نشده
چکیده
Mining frequently occurring patterns or itemsets is a fundamental task in datamining. Many ad-hoc itemset mining algorithms have been proposed for enumerating frequent, maximal and closed itemsets. The datamining community has been particularly interested in finding itemsets that satisfy additional constraints, which is a challenging task for existing techniques. In this paper we present a novel approach to itemset mining whereby the set of all itemsets are compiled into a compact form, closely related to binary decision diagrams. While there were previous attempts to utilize decision diagrams for storing the set of frequent itemsets, to the best of our knowledge, this is the first approach that does not rely on backtrack search to generate such a set. Instead, we use a pure knowledge compilation inference mechanism based on pairwise conjunctions of decision diagrams. Our empirical evaluation demonstrates that our approach is complementary to current state-of-the-art approaches. While we do not dominate these approaches, we show that our pure compilation approach can handle instances that remain out of reach for current state-of-the-art itemset miners.
منابع مشابه
A New Algorithm for High Average-utility Itemset Mining
High utility itemset mining (HUIM) is a new emerging field in data mining which has gained growing interest due to its various applications. The goal of this problem is to discover all itemsets whose utility exceeds minimum threshold. The basic HUIM problem does not consider length of itemsets in its utility measurement and utility values tend to become higher for itemsets containing more items...
متن کاملA Survey on Moving Towards Frequent Pattern Growth for Infrequent Weighted Itemset Mining
Data Mining and knowledge discovery is one of the important areas. In this paper we are presenting a survey on various methods for frequent pattern mining. From the past decade, frequent pattern mining plays a very important role but it does not consider the weight factor or value of the items. The very first and basic technique to find the correlation of data is Association Rule Mining. In ARM...
متن کاملAMKIS: An Algorithm for Association Mining
Mining frequent items and itemsets is a daunting task in large databases and has attracted research attention in recent years. Generating specific itemset, K –itemset having K items, is an interesting research problem in data mining and knowledge discovery. In this paper, we propose an algorithm for finding K itemset frequent pattern generation in large databases which is named as AMKIS. AMKIS ...
متن کاملA survey of itemset mining
Itemset mining is an important subfield of data mining, which consists of discovering interesting and useful patterns in transaction databases. The traditional task of frequent itemset mining is to discover groups of items (itemsets) that appear frequently together in transactions made by customers. Although itemset mining was designed for market basket analysis, it can be viewed more generally...
متن کاملContextual Itemset Mining in DBpedia
In this paper we show the potential of contextual itemset mining in the context of Linked Open Data. Contextual itemset mining extracts frequent associations among items considering background information. In the case of Linked Open Data, the background information is represented by an Ontology defined over the data. Each resulting itemset is specific to a particular context and contexts can be...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010